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Abstract. We study the statistical properties of the Luminous Red Galaxies sample from the Sloan Digital Sky 
Survey. In particular we test, by determining the probability density function (PDF) of galaxy (conditional) counts 
in spheres, whether statistical properties are self-averaging within the sample. We find that there are systematic 
differences in the shape of the PDF and in the location of its peak, signaling that there are major systematic 
effects in the data which make the estimation of volume average quantities unreliable within this sample. We 
discuss that these systematic effects are related to the fluctuating behavior of the redshift counts which can be 
originated by intrinsic fluctuations in the galaxy density field or by observational selection effects. The latter 
possibility implies that more than 20% of the galaxies have not been observed and that such a selection should 
not be a smooth function of redshift. 
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1. Introduction 



The sample o f 



Lum inous Red Galaxies (LRG) 
( Eisenstein et al.l . 2001) from th e Sloan Digital Sky 
Survey (SDSS) (jYork et al.l . |2000[ ) is considered to be 
the best sample to study galaxy large scale structures. 
One of the main features observed in the LRG sample 



from the final data release of the SDSS (jAbazaiian et al 
l2009f ). is that the number density as a function of redshift 
n{z), usually called the "selection function", presents a 
complex behavior. Given that, by construction, the LRG 
sample should be y o lume-limited (lEisenstein et al. . 2001 



the data, other than the behavior of n{z) itself. Indeed 
a different possibility is that the behavior of n(z) is 
determined by intrinsic fluctuations in the distribution of 
galaxies and not by selection effectfl 

We note that if (unknown) selection effects are consid- 
ered to play a major role in this sample, the question that 
should be addressed concerns the qua U ty of the data. In 
the original paper by lEisenstein et al. I (|200lll it is stated 
that the sample is volume-limited up to z 0.38, modulo 
minor effects due to K-l-e corrections (i.e. K-corrections 
and luminosity evolution). To support that this is the case, 
n{z) was found, in the early SDSS LRG data ( see their 



Zehavi et all 120051: iKazin et all I2OIOI ) the behavior of 



Figs. 1 2-13), to be close to flat. However Ei senstein et al 



n{z) is expected to be constant if galaxy distribution 
is close to uniform (up to Poisson noise and radial 
clustering). It is instead observed that the LRG sample 
n{z) shows an irregular and not constant behavior. 
An explanation that it is usually giv en for this result 
( Zehavi et all . 120051 iKazin et al] . l2010ll . is that the LRG 
sample is "quasi" volume limited, in that it does not 
show a constant n{z). Thus, the features in n{z) are 
absorbed in the properties of a selection function, which 
is unknown a priori, but that it is defined a posteriori 
as the difference between an almost constant n(z) and 
the behavior observed. Clearly this explanation is unsat- 
isfactory as it is given a posteriori and no independent 
tests have been provided to corroborate the hypothesis 
that an important observational selection effect occurs in 



( 20051 ). by considering a much larger LRG sample, noted 
that the comoving number density displays a close to 
a constant behavior, "although fluctuations are reaching 
about the 30% peak-to-peak" (see their Fig.l). More re- 
cently, by considering the LRG sample with the same se- 
lection in magnitude an d redshift as the one considered by 
Eisenstein et al.l ( 20051 ). but from the final S DSS data re- 
lease which is larger by a factor ~ 3 in volume, Kazin et al.l 
(2010) (see their Fig.l) noted that there are non-negligible 
variations in n(z), that, as mentioned above, make diffi- 



^ Note that for z > 0.36 there is instead a clear selection 
effect due to the "passa ge of the 4000 A break into the r 
band" (|Kazin et al.l . l2O10l ). This is shown by a smooth redshift- 
dependent decrease of the redshift counts. For this reason we 
limit our analysis to z = 0.36. 
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cult to consider this sample as a purely volume limited 
one. 

In this paper we determine the probability density 
function (PDF) of conditional galaxy counts in spheres q 
In practice we study the statistical properties of the vari- 
able Ni{r), which is defined to be the number of galaxies 
in a sphere a radius r centered on the i*'' galaxy. This 
has the advantage to provide a characterization of condi- 
tional fluctuations which is not affected by the geometri- 
cal constraints of the sample as 'n(z), which instead ex- 
plores different volumes at different redshifts. In addition, 
by measuring the PDF in different regions of the sample 
we can make a clear test to check whether fluctuations are 
self-averaging inside the sample. This is the fundamental 
property that fluctuations are required to have when an 
average quantity is measured in a given sample. 

Indeed a spatial average is meaningful, in the sense 
that it provides an estimation of the ensemble average 
property of the given statistics, only when fluctuat i ons are 



stationary inside the samp l e (ISvlos Labini et al.l . [2009a; 



Svlos Labini and Barvshev . 2010[ ). The breaking of self- 
averaging properties can be generally due to two different 
reasons. On the one hand when a distribution is not statis- 
tically translational and/or rotational invariant, then the 
fluctuation properties depend on the specific position of 
the volume in which they are measured. For instance, if 
the distribution is spherically symmetric around a point, 
then the relevant role is played by the distance R from 
the center: fluctuations at different distances R have dif- 
ferent properties and for this reason any volume average 
quantity, computed in regions large enough so that sta- 
tistical properties (i.e., the local density) change signifi- 
cantly, do not give an useful information about the real 
properties of the distribution. This same situation occurs 
when a distance-dependent observational selection effect 
occurs in the data as it can mimic the break of transla- 
tional invariance. 

On the other hand self-averaging properties can be bro- 
ken when fluctuations, in a given samp le, are too extended 



in spac e and have too large amplitude (jSvlos Labini et al 
l2009allbh . For instance when there are a few large scale 
structures in given volume, or even a single one, which 
dominates the distribution then it is not possible to get 
a meaningful estimation of an average quantity at large 
enough scales in the sample. This occurs precisely be- 
cause the sample volume is not large enough to aver- 
age between different large enough structures. This is a 
systemati c effect that some t imes is refereed to as cosmic 
variance (|Yang and Saslawl I2OIOI) but that IS more ap- 



ance (which involves already the computation of an av- 
erage quantity) maybe without statistical meaning in the 
circumstances described above. 

The breaking of self-averaging properties was found to 
occur in several volume-limited sample s of the main galaxy 



sampl e (MGS) of the DR6-SDSS (ISvlos Labini et al 



2009al ). Indeed, it was found that at large enough scales, 
i.e. r > 30 Mpc/h, the PDF of conditional fluctuations 
shows systematic differences when it is measured in vol- 
umes located in different positions in the sample. That 
this corresponds to the breaking of self-averaging prop- 
erties and not the the breaking of statistical isotropy 
and/or homogeneity (for intrinsic or observational rea- 
sons) is shown by two facts. The first is that the difference 
between the PDF, measured into two volumes located at 
small and high redshifts, is not systematically the same. 
That is, sometimes the PDF measured at low redshift 
is shifted toward smaller val ues and sometimes toward 
higher values. As discussed bv lSvlos Labini and Barvshev 
([2010) the breaking of statistical translational invariance, 
as well as the effect of a redshift dependent selection ef- 
fect, would be signed by a difference in the PDF at low 
and high redshift always in the same direction. Secondly, 
when th e larger volunies pr ovided by DR7-SDSS was con- 
sidered (lAntal et all . l2009l ). the PDF did not show the 
same difference: this is a clear indication that the breaking 
of self-averaging properties is due to a systematic volume- 
dependent effect. In addition, when self-averaging proper- 
ties were found to be satisfied the PDF was found to be 
nicely fitted by a Gumbel function and its first and second 
moments, the average conditional density and the condi- 
tional variance, were found to have a scaling behavior as 
a function of distance. All these behaviors mark a c lear 
departure from spatial uniformity ( Antal et al. . 20091 ). 



In this paper we perform the test for self-averaging in 
the LRG sample. In Sectl^we recall the basic statistical 
elements which we use in the data analysis. The main fea- 
tures of the data are presented in Sectl3]while in Sect|3]we 
show our results. We discuss our results in Sect IS] by com- 
paring the results obt ained here with the one s measured 
in the SDS S-MGS bv [ Lovedavl (l2004 : ISvlos L abini e£aD 
(|2009al lbh: lYang and Saslawl (120101 ) and in the 2 de- 
gree field redshift survey (2dFGRS) bv lSvlos Labini et al 



|2009c,d). In addition we critically consider t he measure- 



ments of the conditional average density by iHogg et al 



prop riately defined as breaking of self-averaging proper 



( 20041 ). the assumptions entering in the determinatio n of 
the two-point correlation function ( Kazin et al. . 2010t) and 
the problem underlyir ig the determination of the so-called 
baryon acoustic peak ( Eisenstein et al.', ^ 2005t Kazin et al 



20 lot iMartmez. et al 



2009; Svlos Lab ini et all . |2009e) 



ties (ISvlos Labmi et al.l . 12(3 . as the concept of vari- finally we draw our conclusions in Sectji 



^ Conditional properties are local, and they do not require 
the measurement of global statistical quantities on the sam- 
ple scale. They are well defined both for spatially uniform 
and inhomogeneous distributions, while unconditional statisti- 
cal quan tities are well defined only for spatially homogeneous 
systems (|Gabrielh et all . I2OO5I ). 



2. Statistical background 



As it was discussed by ISvlos Labini et al" (l2009allbl ): 
Svlos Labini and Barvshev ( 2010() a simple test to deter- 



mine whether a point distribution is self-averaging in a 
given sample of linear size Rs consists in studying the PDF 
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of conditional galaxy counts in spheres M (which contains, 
in principle, information about moments of any order) in 
sub-samples of linear size r < Rs placed in different and 
non-overlapping spatial regions of the sample (that we call 
Si, S2, ---Sn)- That the self-averaging property holds is 
shown by the fact that Ps^ {Af; r) is the same, modulo sta- 
tistical fluctuations, in the different sub-samples, i.e., 



Fs.(A/';r) «Ps,(A/-;r) V*^j 



(1) 



On the other hand, if determinations of Ps; (A/", r) in 
different sample regions Si show systematic differences 
then, as discussed above, the distribution is not self- 
averaging because of the presence of non-averaged large 
scale structures or because it not statistically transla- 
tional and/or rotational invariant (this includes the case 
in which a redshift dependent observational selection ef- 
fect is present). When EqU] is not found to be satisfied 
in a given sample then the determinations of the spatial 
averages are sample-dependent implying that those sta- 
tistical quantities do not repre sent the asymptotic prop- 
erties of the given distribution (ISylos Labini et al. . 2009at 
Svlos Labini and Barvshev , 2010l ). 

To test self-averaging is necessary to employ statis- 
tical quantities that do not require the assumption of 
spatial homogeneity inside the sample and thus avoid 
the normalizatio n of fluctuations to the est imation of the 
sample average (jSvlos Labini et al.l . l2009al ). We therefore 



consider the PDF of the stochastic variable defined by 
number of points Ni(r) contained in a sphere of radius 
r centered on the point. This depends on the scale 
r and on the spatial position of the i*^^ sphere's cen- 
ter, namely its radial distance Ri from a given origin 
(in this case the Earth) and its angular coordinates oti. 
Integrating over for fixed radial distance Rj. we ob- 
tain that N (r) = N(r:RA (I Svlos Labini et all. l 2009a : 



Antal et al. 



In Antal et al 



2009 : lSvlos Labini and Baryshev, 2010). 



(|2009[ ) we showed that the Gumbel dis- 
tribution was a good fit to the data of the SDSS-MGS, 
marking a clear departure from the Gaussian behavior ex- 
pected for spatially uniform systems. The Gumbel dis- 
tribut ion's PDF, at fixed r, is given by (see lAntal et al, 
(j2009l) and references therein) 



P{N) = iexp 



exp 



N 



(2) 



The mean and the standard deviation (variance) of the 
Gumbel distribution (Eql2]), depend on the scale r, and 
are given by 



(3) 



where 7 = 0.5772 ... is the Euler constant. 
3. The data 

The sample we consider in this paper is one of the samples 
prepared bv lKazin et al. IdloiS). In particular we focus on 
the so-called DR7-Dim, while we do not present results 



for the other samples, as the one denoted as bright that 
contains too few galaxies for a reliable statistical analysis. 

The metric distance i?(z; i7„i, fiA) from the redshift 
has been computed by using the cosmological parameters 
with values VL„i = 0.25 and i^A = 0.75. 

Given that we exclude redshifts z > 0.36 and z < 0.16, 
the distance limits are: Rmin = 465 Mpc/h and Rmax = 
1002 Mpc/h. The limits in R.A a and Dec. S considered 
are chosen in such a way that (i) the angular region does 
not overlap with the irregular edges of the survey mask 
and (ii) the sample covers a contiguous sky area. Thus we 
have chosen: amin = 130° and a,„aa; = 240°; Smin = 0° 
and Smax — 50° . The absolute magnitude is constrained 
in the range M g [-23.2, -21.2]. With these limits we find 
A^ = 41833 galaxies covering a solid angle ft = 1.471 sr. 

4. Results 

As discussed in Sect 12] we study the statistical properties of 
the random variable Ni{r) measuring the integrated num- 
ber of points in a sphere of radius r around i ~ l...A/(r) 
points. The number of points over which it is possible to 
compute this quantity at the scale r, denoted as M{r), de- 
pends on the geometrical constraints of the sample as we 
consider only spheres that are fully enclosed h i the sample 
boundaries (see for details ISvlos Labini et al.l ((2_Q_0_9a) ) . At 
fixed r we make an histogram of the M(r) values of Ni{r), 
which thus represents an estimation of the PDF P{N; r) 
of conditional fluctuations. 

In Figll] we show the behavior of the number density 
as a function of distance. There are two main features: (i) 
a negative slope between 400 Mpc/h < r < 800 Mpc/h 
(i.e., 0.16 < z < 0.28) and (ii) a positive slope up to a local 
peak at r ~ 950 Mpc/h (i.e., z ^ 0.34). Additional fea- 
tures are present as peaks in the n{r) behavior at r 500 
(z ~ 0.18) and r - 650 Mpc/h (z - 0.23). Note that if 
n(z) were constant we would expect a behavior similar 
to the one shown by the mock sample extracted from the 
Horizon simul ation with the s ame geometry of the real 
LRG sample (|Kim et al.l . I2OIOI ) (see FigH]). This implies 
that, by addressing the behavior of n{z) to unknown se- 
lection effects, it is implicitly assumed that the survey has 
lost more than the 20% (for hq = 1.2 x lO"'* galaxies for 
(Mpc/h) ~^ ) of the gal axies for observa t ional problems. 
This looks improbable (jEisenstein et al. . 2001 ) although 
a more careful investigation of the problem must be ad- 
dressed. Note also that the deficit of galaxies would not be 
explained by a smooth redshift -dependent effect ( as it oc- 
curs for z > 0.36 — see Fig.l o fH azin et al.l (|2010l )) rather 
the selection must be strongly redshift dependent as the 
behavior of n(z) is not monotonic. These facts point, but 
do not proof, toward an origin of the n(z) behavior due 
to the intrinsic fiuctuations in the galaxy distribution. 

The analysis of fluctuations by studying n{z) shows 
clearly that the observed behavior is incompatible with 
model predictions. However the redshift distribution pro- 
vides only a rough analysis of fluctuations, especially be- 
cause it is not an average quantity and because it samples 
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Fig. 1. Number density as a function of distance for the 
LRG sample and for a mock sample extracted from the 
Horizon simulations (units are in (Mpc/h) ~^). The blue 
dashed line decays as and it is plotted as reference. 



Fig. 3. PDF for r = 50 Mpc/h in the low (L) redshifts 
and the high (H) redshifts sample. The number of points 
contributing to the histogram is respecitively for L and H 
M (r) = 13277, 13099. 
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Fig. 2. PDF for r = 50,100,150,200 Mpc/h together 
with the best with EqU) The number of points con- 
tributing to the histogram is respecitively M{r) — 
26376,18916,6645,2829. 



different scales differently as the volume in the different 
redshift bins is not the same. For this reason we determine 
the statistical properties of the stochastic variable Ni{r) 
previously defined, which allows us to compute the full 
PDF in sub-samples of equal volume. In Figl^lwe report 
the PDF at different scales in the LRG sample, together 
with the best fit with EqlJl which provides a reasonably 
good fit. A quantification of the difference of the measured 
PDF from the Gumbel or Gaussian behaviors is difficult 
for the problems discussed below. 

Let us now pass to the self-averaging test described in 
Sect 121 To this aim we divide the sample into two non- 
overlapping regions of same volume, one at low (L) red- 
shifts and the other at high (H) redshifts. We then mea- 
sure the PDF Pl{N; r) and Ph{N; r) in the two volumes. 



Given that the total number of points is not very large 
(i.e., M{r) ~ 10**), in order to improve the statistics espe- 
cially for large sphere radii, we allow a partial overlapping 
between the two sub-samples, so that galaxies in the L (H) 
sub-sample counts also galaxies in the H (L) sub-sample. 
This overlapping clearly can only smooth out differences 
between PL{N;r) and PH{N]r). Results are shown in 
FigsHlUfor r = 50, 100, 150, 200 Mpc/h respectively. One 
may note that for r = 50 Mpc/h the two determinations 
are much closer than for lager sphere radii for which there 
is actually a noticeable difference in the whole shape of the 
PDfB The fact that PH{N]r) is shifted toward smaller 
values than PL{N;r) is related to the decaying behavior 
of the redshift counts (see FigUl): most of the galaxies at 
low redshifts see a relatively larger local density than the 
galaxies at higher redshift. 

To clarify the features expected in LCDM models, we 
have done the same analysis in realiz ation of the LRG 
sample from the Horizon mock samples Kim et al. ( 2010l ). 
In FigslTllSlwe report the PDF computed in the full sam- 
ple P{N;r) and in the two half-samples PL{N;r) and 
PH{N;r) as done for the real sample. One may note 
that the PDF is much more regular and has a bell-shape. 
Indeed the Gaussian fit is reasonably good. A complemen- 
tary aspects of these features is provided, as discussed 
above, by the behavior of the radial density that is al- 
most constant but noisy, where in this case fluctuations 
are only due to Poisson noise and radial clustering (see 



^ The Kolmogorov- Smirnoff test (|Press et al.l . |2007| ) allows, 
in general, to make a statistical test to measure the distance 
between two cumulative PDF. In all cases discussed here, we 
find that both the Gaussian and the Gumbel functions do not 
fit the data at a reasonable level of confidence. In addition 
when applying the same test to study whether the data from 
the H and S samples are drawn from the same distribution, we 
also find that the answer is in the negative. 
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Fig. 4. As FigEl but for r 
7929, 7690. 



= 100 Mpc/h. Here M(r) 



Fig. 7. PDF for r ^ 100 Mpc/h in the low (L) redshifts 
and the high (H) redshifts sample for the Horizon mock 
LRG sample. 




Fig. 5. As FigElbut for r = 150 Mpc/h. Here M(r) = 
3495,3150. 
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Fig. 8. PDF for r = 150 Mpc/h in the low (L) redshifts 
and the high (H) redshifts sample for the Horizon mock 
LRG sample. 



FigHJ. These results are clearly expected from a simple 
theoretical analysis: LCDM models are such that non- 
linear clustering occurs only at small scales r < 10 Mpc/h 
and thus the distribution properties at ~ 100 Mpc/h 
are characterized by Gaussianity, small amplitude fluctu- 
ations and weak two-poin t correlations (see discussion in 
Svlos Labini et al.l (|2009al) ). 



Fig. 6. As FiglUbut for r = 200 Mpc/h. Here M(r) 
1465,1354. 



As a final remark, we note that when applying a ran- 
dom selection to the mock catalog, in such a way to get 
the same n{z) of the real data, the results of our tests 
are compatible with the data. Thus, a possible, but as 
mentioned, unsatisfactory, conclusion is that the LRG is 
compatible with the mock catalog. We stress again that 
this conclusion is based on the assumption that an impor- 
tant selection effect, which is not smooth with redshift, 
occurs in the definition of the sample. 
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Fig. 9. Radi al density in the volume lirt iited samples of the 
MGS I'from ISvlos Labini et alJ (l2009al) ') and in the LRG 
sample, the n{z) for the MGS volume limited samples has 
been normalized by taking into account the different se- 
lection in luminosity in the different samples while for the 
LRG units are arbitrary. 



5. Discussion 

We have discussed that the number density as a func- 
tion redshift n{z) of the LRG sample shows, for a wide 
redshift range, a decreasing behavior as a function of red- 
shift, followed by a sharp increase around z ^ 0.36. A 
different behavior was detected by the n(z) of the bright 
galaxies in the MGS. In particular iLovedavl (j2004l) found 
that the number density of bright galaxies increases by a 
factor « 3 as redshift increases from z = 0toz = 0.3. 
This is shown, for instance, by the increase of n{R) up tp 
z ^ 0.2 in the two volume limited sam ples VL4 and VL5 
of the MGS shown in FigIS] (from the Svlos Labini et all 
(|2009al) ). At smaller scales the differential number density 
shows a fluctuating behavior. To explain the rise of the 
differential number density in the deepest samples, which 
contain in average the brightest galaxies of the MGS, a sig- 
nificant evolution in the luminosity and/or numb er density 
of ga laxies at redshifts z < 0.3 as been invoked ( Lovedavl 
20041) . On the other hand, ISvlos Labini et al.l ()2009al ) bv 
measuring the PDF of galaxy counts in spheres, concluded 
that the volume-limited samples of the MGS, extracted 
from the SDSS sixth data release, are characterized by 
the breaking of self-averaging properties for scales r > 30 
Mpc/h and that the increase in the bright galaxies num- 
ber density is interpreted as an effect due to large scale 
inhomogeneities . 

In addition, for galaxies with magn itudes around M* 
it was subsequently shown bv lAntal et a l. (2009) that self- 
averaging properties are verified when the larger volumes 
of the final data release of SDSS is considered. This ac- 
tually poofs that the breaking of self-averaging proper- 
ties is du e to a finite volume effect and n ot to galaxy 
evolution ( Svlos Labini and Barvshev . 2010l ). Moreover in 



the deeper samples, containing the brightest galaxies, self- 
averaging properties were fou nd not to be satisfied even in 
the S DSS final data release (| Svlos Labini and Barvshev . 
20101) . In that case, in agreement with ^Lovedavl (j2004l ). 



it was observed a growth of radial density rather than a 
decrease. In summary, these behaviors support the con- 
clusion that galaxies from the MGS of the SDSS are 
not compatible with spatial homogeneity at scales ~ 100 
Mpc/h. A similar conclusion was drawn from the analysis 
of the 2dFGRS bylSvlos Labini et al.l (I20n9cl ldl) . Recently 



Yang and Saslawl (|2010l ) found that cosmic variance in the 



SDSS causes the counts-in-cells distributions in different 
quadrants to differ from each other by up to 20%, a result 
which corroborate the results discussed here. 

We note that the same systematic effects discussed 
for the behavior of conditional fluctuations similarly af- 
fect the determinations of the two-point correlation func- 
tion and clearly the detection of the so-called baryon 



2005: Kazin et al 



acoustic oscillations (lEisenstein et al 
2OIOI : iMartfnez. et al.l . l2009l) ~ It was indeed shown by 



Svlos Labini et a'Lr ( 2009el ) that major systematic effects 



are present in the estimators of ^ (r) and that the variance 
estimated from the sample itself is much larger than the 
one deduced from the analysis of mock catalogs, making 
the detection of the baryon acoustic oscillations statisti- 
cally unreliable. 

Instead of i nvestigating the ori gin of the fluctuating be- 
havior of n(z), Kazin et al. ( 2010l ) focused their attention 
on the effect of the radial counts on the determination of 
the two-point correlation function ^(r). In particular, they 
proposed mainly two different tests to study what is the 
effect of n{z) on the determination of £,{r). The first test 
consists in taking a mock LRG sample, constructed from a 
cosmological N-body simulation of the LCDM model, and 
by applying a redshift selection which randomly excludes 
points in such a way that the resulting distribution has 
the same n(z) of the real sample. Then one can compare 
^(r) o btained in th e origin al mock and in redshift-sampled 
mock. Kazin et al. ( 2010l ) find that there is a good agree- 
ment between the two. This shows that the particular kind 
of redshift-dependent random sampling considered for the 
given distribution, does not alter the determination of the 
correlation function. In other words this shows that, un- 
der the assumption that the observed LRG sample is a 
realization of the mock LCDM simulation, the n{z) does 
not affect the result. However, if we want to test whether 
the LRG sample has the same statistical properties of the 
mock catalog, we cannot clearly proof (or disproof) this 
hypothesis by assuming a priori that this is true. 

The second test consists in computing ^(r) in the 
data, by using a pair-counting method which mea- 
sures two-point correlations by using a random dis- 
tribution as reference and by comparing the number 
of data-data pairs with the number of data-random 



and /or random-random pairs (jPavis and Feeblest 11983 



Landav and Szalavl . ll993l ). Then, to include the variations 



of n{z) in the data one considers a random sample that 
is not a purely Poisson distribution. Instead, the random 
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sample is generated so that it has the same n{z) behavior 
as in the data. This means that in each redshift shell at 
distance z and of thickness Az the distribution is Poisson 
with the same density n(z) as observed. This test shows 
only that a random sample modified in such a way, does 
not alter the dete rmination of ^(r) (as it is found by 
Kazin et al. Instead, it does not show that the 

features in the observed n{z) behavior are taken into ac- 
count by the fact that the random sample has the same 
features in the redshift counts. 

The behavior of the average conditional properties 
is clearly ill-defined as long as self-averaging properties 
are not found to be stable: indeed the average condi- 
tional density is lust the first moment of the PDF. In 
Svlos Labini et al.l (|2009al ) we have discussed that up to 
~ 30 Mpc/h the average conditional density shows ap- 
proximately a ~ behavior, while in Antal et al. (2009|) 
we find that there is an apparent change of slope at larger 
scales where the average conditional density decays as 
approximately up to 100 Mpc/h. A change of 



-0.3 



slope at ~ 30 Mpc/h from ~ r ^ was found bv lHogg et al 



in a preliminary sample of the LRG. However they 
claimed that a clear transition to a flat behavior, signal- 
ing uniformity, was reached at ^ 70 Mpc/h. With the 
present data, which represent an improvement of almost a 
fac tor 3 in vol u me w ith respect to the data considered 
by iHogg et aL ( 2004 ). because of the breaking of self- 
averaging properties, we are not able to make a reliable 
determination of the average conditional density at scales 
of the order of ^ 100 Mpc/h. However, we note that our 
results are not compatible with a transition to uniformity 
at those scales. 



6. Conclusions 

The number density as a funct ion of redshift in t he final 

2010l) shows 



LRG sample from the SDSS (jKazin et al. 



an irregular and mainly decaying behavior which cannot 
be simply explained by a smooth redshift-dependent selec- 
tion. We have discussed that the behavior oin{z) is closely 
related to the properties of fiuctuations in this sample. We 
conclude that these are not self-averaging and thus they do 
not allow us to make a precise statement on the behavior 
of volume average quantities, as the average conditional 
density or the two-point correlation function. Either the 
behavior of n{z) is determined by intrinsic fiuctuations in 
the LRG distribution which are in amplitude and spatial 
extension much larger than the ones expected in LCDM 
models, or there are major observational selection effects 
intervening in the definition of the LRG sample. For this 
reason we conclude that forthcoming galaxy samples, as 
the data from the B aryon Oscillation Spectroscopic Survey 
(IWhite el al.l . l2010ll , with more controlled selection effects, 
will hopefully clarify the situation. 



I also wish to thank David Hogg, Abhilash Mishra and Subir 
Sarkar for discussions and comments. I acknowledge the use of 
the Sloan Digital Sky Survey data ( |http : //www . sdss ■ org[ ) . 
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